Back-stepping, hidden substeps, and conditional dwell times in molecular motors 
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Processive molecular motors take more-or-less uniformly sized steps, along spatially periodic 
tracks, mostly forwards but increasingly backwards under loads. Experimentally, the major steps 
can be resolved clearly within the noise but one knows biochemically that one or more mechanochem- 
ical substeps remain hidden in each enzymatic cycle. In order to properly interpret experimental 
data for back/forward step ratios, mean conditional step-to-step dwell times, etc., a first-passage 
analysis has been developed that takes account of hidden substeps in iV-state sequential models. 
The explicit, general results differ significantly from previous treatments that identify the observed 
steps with complete mechanochemical cycles; e.g., the mean dwell times r+ and r_ prior to forward 
and back steps, respectively, are normally unequal although the dwell times r++ and r__ between 
successive forward and back steps are equal. Illustrative (N = 2)-state examples display a wide 
range of behavior. The formulation extends to the case of two or more detectable transitions in a 
multistate cycle with hidden substeps. 

PACS numbers: 05.20.Dd, 05.40.-a, 87.16.Nn, 82.37.-j, 82.37.Np 



I. INTRODUCTION 



embodied in the following basic sequential model 
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Processive motor proteins or molecular motors [j], 0] 
(such as kinesin, cytoplasmic dynein, and myosin V) 
"walk" along molecular tracks (microtubules and actin 
filaments) taking observed mechanical steps of well de- 
fined (mean) spacing d , each step being of "negligi- 
bly short" (< 100 fis) duration relative to the mean 
time(s) between steps that are of order 1 to 20 ms 

a a 1 1 a i a a a m m m m u m m m- 

Steps may be taken forwards (+) or backwards (— ). The 
mean velocity (observed over 10's to 100's ) of steps, V, 
is a function of the load F = (F x ,F y ,F z ) exerted on 
the motor and the fuel concentration [ATP] (for most 
cases) [HI, EH Hi! ancl > m general, of other features of the 
aqueous solution including the pH, ionic strength, tem- 
perature T, and other reagents/reactants such as [ADP], 
[Pi], [AMP-PNP], [BeF 2 ], etc.)[H, @, M 



Motor proteins are enzymatic catalysts that, following 
biochemical knowledge and principles, turn over one "fuel 
molecule" (usually ATP) for each full step via (in the 
simplest case) a linear sequence of reversible kinetic tran- 
sitions (or reactions) embodying N (bio)chemical states 
per turnover [H [H, H3, HH, |23, [gl, [2J . This situation is 
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which is understood to repeat periodically as the motor 
moves processively along its track. The subscript I — 
1, 2, • ■ • labeling the TV basic states [0], (1), •• • (JV - 1), 
denotes the sites on the linear track spaced at distance d 
apart. 

By convention the state (i) — [0] is "bound" or 
"nucleotide- free" so that the transition [0] — > (1) rep- 
resents the binding of one fuel molecule to the awaiting 
motor. Thus we write 



uo = *o[ ATP L 



(2) 



where the pseudo-first-order rate constant fco and all the 
remaining rate constants itj, Wj depend also on F. But 
under fixed conditions (F, [ATP], • • • ), the rates do not 
change. Note that this formulation embodies the tight 
coupling principle of one fuel molecule being consumed 
per (forward) step [E QJ, [l^ . l2~ij . This is assumed in the 
basic model, which also neglects irreversible detachments 
from the track (wh i ch, however, can be included readily 
in principle [H Ml, i| ) . 

When convenient we will allow the state labels i, j, 
to take values outside the basic range [0, N — 1]; for that 
reason we adopt the periodicity convention 
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U i+ lN = Ui, Wj+lN=1Vj, I = 0,±1,±2, ••• . (3) 

Now, in the simplest experimental situation, as ob- 
served for kinesin, no mechanical substeps are de- 
tected [El, [I3J] to within the noise level (which amounts 
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to Ax < 1 run) . Furthermore to within the resolu- 
tion time (< 100 /is), successive steps occur at times, 
say, ■ • ■ , tk-i, tk, ifc+i, • ■ • ■ Thus, between the identi- 
fiable mechanical steps of (mean) magnitude d, the mo- 
tor dwells in a mechanical state that, within the noise 
level Ax, appears well defined with no systematically de- 
tectable substeps, forwards or backwards. Then, individ- 
ual dwell times in the mechanical states, namely, 

r fe =4-4_i, (4) 

can be measured to reasonable precision and averages 
may be computed, over "many" observations encompass- 
ing, say, n steps, to yield an overall mean dwell time 

1 ™ 

k=l 

Here and below we use the "asymptotically equals" sym- 
bol w to indicate an approximate equality that becomes 
exact in a long run under steady-state conditions. 

Given a (sufficiently long) sequence of n observed steps 
with n + forward steps and n_ backward steps, we can 
also define the (steady-state) step splitting probabilities 
or back-step and forward-step fractions 

7T + w n+/n, 7r_ w n_/n, (6) 

where, since n = n + + n_, one has 

7T + +7T_ = 1. (7) 

Furthermore, dwell times before a + or — step can 
be (and have been |13] ) measured separately leading to 
distinct prior dwell times 

t+ = — J2 +T k and t_ = — X^T fc , (8) 
n + rt- 
the restricted sums including just + or — steps, respec- 
tively. 

To the degree that the runs are long so that 7r + and 7r_ 
may be accurately considered as probabilities one must 
evidently also have 

7T + T + + 7T_T_ = T. (9) 

As discussed recently in some detail [2(| [H[ , each in- 
dividual biochemical state («), may be characterized by 
a definite (mean) longitudinal location in physical space, 
i.e., along the track, which we supposed aligned with the 
x coordinate, and, possibly, transverse to the track, the 
y-coordinate, or normal to the track, the z-coordinate. 
Hence the basic model implies the existence of substeps, 
say, of magnitude 

dj = x j+1 - Xj , (10) 

between successive mechanochemical states [25|. How- 
ever, the great majority of these mechanical displace- 
ments will be hidden by noise and so unobservable. This 
is the crucial issue. 



The evidence (in particular for kinesin 0, [l3[ ) reveals 
the existence of one principal or major mechanical 
substep of magnitude 

d-M = x M +i - %M ^ d, (11) 

that corresponds to a specific transition (M)— >(M+1) for 
a forward or + step. Such a unique forward-step is some- 
times called a "power stroke" . Then, clearly, within the 
basic model a back-step (— ) corresponds to the specific 
transition (M + 1)— >-(M). 

For simplicity we will initially suppose that there is 
only one such single, well defined and observable princi- 
pal mechanical transition in the processive reaction cy- 
cle: it will be referred to as a major transition while 
all other smaller, unobservable displacements, presumed 
"hidden," will be termed substeps. 

It is of interest, all the same, to analyze situations in 
which, within the full cycle, there are a number of visible 
(or observable) substeps. Indeed, an initial substep large 
enough to be readily observable was predicted for myosin 
V by Kolomeisky and Fisher [l|| on the basis of dwell- 
time data obtained at different [ATP] and force levels |3f ; 
it was later observed unambiguously by Uemura et al [1 01 ] . 
Thus, in SecfVlbelow. the case of K (< N) distinct major 
substeps is considered explicitly (26j . 

Nevertheless, since most of the forward and reverse 
transitions, (i) — » (i — 1) and (i — 1) — > (i), are not 
observable, one does not know (and cannot tell) the 
(bio)chemical (sub)state of the motor during an observed 
dwell time, t^, between steps k— 1 and k: see @. Indeed, 
the biochemical state will change as time progresses and 
not necessarily in a uniform sense, e.g., ATP might bind 
and then be released (or unbind) without undergoing a 
hydrolysis step. For this crucial reason once the basic 
model has N — 2 or more states the expressions for r+ 
and r_ in terms of the basic rates Ui and Wj cannot be 
trivial — and the same goes for the splitting or backwards 
and forwards probabilities 7r + and 7r_ . 

The basic theoretical problem is thus to find explicit 
expressions for 7r + , 7r_, and for the partial dwell times 
as well as for conditional or pairwise stepping fractions, 

7r ++ , 7r |_, • ■ • , and dwell times, t ++ , t |_, ■ ■ • , that it is 

natural to introduce (as seen below). Indeed, it is clear 
that these general statistical concepts are not restricted 
to linear or translocational motors, on which we have 
focussed; in fact, they apply equally to rotary stepping 
motors like Fi-ATPase [ll^llaloLFiFo-ATPase [U, 
HH , and bacterial flagellar motors [2|.|33|| . However, since, 
in these respects, kinesin and myosin V have been studied 
more extensively, we will retain the language appropriate 
for processive motor proteins walkin g on linear tracks. 

In previous theoretical studies [191I341. [3f| H(| splitting 
probabilities and conditional mean dwell times have been 
introduced in the context of molecular motors via the 
following definitions (which are distinguished from those 
used in the discussion above by haceks) specifically: 7r + = 
•n^Ar and 7r_ = itq.-n are the probabilities that a motor 
starting at a well defined physical site / along the track 
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in the binding state (j) = [0] will arrive at the next site 
I + 1 or back at the previous site I — 1 (in both cases in 
state (j) = [0]) without having undergone the opposite 
transition to complete a full backward or forward cycle, 
respectively. Then, similarly, f± = tq 7 ±n are the average 
times a motor spends at site I (starting at j = 0) before 
completing a full forward or backward cycle to site I ± 1 
(in state (j) — [0]). Exact results for such statistics can 
be derived by mapping into a Markov renewal process [HI, 

n 

With these definitions the explicit formulae ob- 
tained [HI H|| for 7r± and f± correspond to a full 
mechanochemical cycle during which a complete forward 
or backward step is certainly taken. However, the result- 
ing expressions can be applied to the analysis of experi- 
mental stepping data only if these data allow one to iden- 
tify each full cycle. If, instead, the observed noise hides 
one or more of the N > 1 biochemical or mechanochemi- 
cal substeps, while only major mechanical transitions are 
detectible, one cannot in general decide unambiguously 
whether a motor executed the detectable steps (or power 
strokes) with or without completing a full cycle. In such 
cases, the previous expressions cannot be applied to ac- 
count for the observed step fractions and dwell times tt± 
and t± . Instead, the results must be modified to allow for 
the ambiguity arising from the hidden substeps. It tran- 
spires, as we show below, that this rather subtle and at 
first-sight inconsequential small difference actually leads 
to significant changes in the load dependence especially 
(but not exclusively) when the fractions of back-steps and 
forward-steps are similar in frequency, i.e., on approach- 
ing stall conditions when the velocity, V, becomes small 
relative to its load- free value [la, Oil 12CJ] • 

To clarify the issues involved, suppose the cycle has 
four states (N = 4) and the major transition occurs be- 
tween states (2)/ and (3)/ (i.e., M = 2). Then in stepping 
time series such as illustrated in Figs. 1(a) and (b), one 
can identify all the moments of time at which the motor 
leaves state (2); and reaches state (3); on moving for- 
wards or when it leaves state (3)/ for state (2)/ on moving 
backwards. When successive forward steps are realized, 
as illustrated in Fig. 1(a), one knows that the motor must 
pass through the remaining two states, [0] and (1), at 
some points between the major transitions [see Fig. 1(a)] 
because state (2) cannot otherwise be reached following 
state (3) at the same site I. Thus in a sequence of three 
successive + steps one can conclude that the middle step 
is associated with a complete (forward) cycle. The cor- 
responding observations are equally true for successive 
back steps. On the other hand, when the overall step- 
ping sequence encompasses both back-steps and forward- 
steps, which is the interesting (and usual) situation [see 
Fig. 1(b)], it is impossible, for example, to be sure that 
the motor has completed a full forward cycle when a (de- 
tectable) forward step is followed by a back step; likewise, 
one cannot tell if a full back cycle was completed. In such 
cases, the full-cycle assumption is not valid. 

The full-cycle assumption can be inadequate even 



when only a run of forward steps is seen, as in Fig. 1(a), in 
that back steps may merely be infrequent. This is some- 
what counterintuitive since one might well argue that 
each + step does then correspond to a full cycle. How- 
ever, if the enzymatic cycle is reversible there is always 
the possibility of a completed back step; thus explicit ex- 
pressions in terms of the basic rates {uj, uij} will differ 
when potentially hidden substeps are allowed for. Never- 
theless, as we demonstrate in Sec. |lVl there are various 
cases in which the quantitative differences may be small. 

To demonstrate further the consequences of different 
conceivable interpretations, consider an (N = 3)-state 
motor with two possible substeps. Fig. 1(c) illustrates 
a stepping series with a relatively high noise level so 
that only the major transitions, say (1); ^ (2)/ for 
M = 1, at times tk (with the corresponding dwell times 
T~k = tk — tfc-i) can be measured. On the other hand, 
Fig. 1(d) shows exactly the same series of steps, but with 
a much lower noise level revealing the previously obscured 
small substeps, [0]; ^ (1); and (2); ^ [0]/+i- In the lat- 
ter case, one can determine the times t£ when the motor 
reaches the bound state [0]; for the first time (i.e., when a 
cycle is completed). And then one can reliably determine 
the number h + of full forward, and n_ of full backward 
cycles. In general, when both forward- and back-steps are 
present the mean values of the cycle times f \ = t® — t { (_ 1 
(and so f, f+, and f_) are quite different from the mean 
step-to-step dwell times r, r + and r_ , that one can obtain 
from the noisy stepping series in Fig. 1(c). The difference 
between the splitting probabilities, tt± and n±, is even 
more obvious. For example in Fig. 1(d) one has only one 
back cycle since one must not consider the major transi- 
tions at times £3 and £4 as indicating full stepping cycles 
because the motor never actually reached the next bound 
state [0]: hence from this sequence one should estimate 
7r + ~ 5/6 and tt_ ~ 1/6 and 7r+/7r_ ~ 5. Conversely 
in Fig. 1(c) one would count 6 forward and 2 back steps 
(or to be more precise, major transitions) leading to the 
estimates ir + ~ 3/4 and 7r_ ~ 1/4 so that ir + /ir- ~ 3. 

From a mathematical viewpoint, although most of the 
transitions and biomechanochemical states remain un- 
seen, there is one bright spot! Specifically, in light of the 
basic feature or model assumption embodied in Eq. |T]), 
at the instant of time before the moment, say tk, at which 
a + step occurs, one can be sure the motor was in state 
(M) while in the instant just after tk the motor is in 
state (AT + 1); and, likewise, just before a backward (— ) 
step the state (M + 1) is occupied, while just after a 
back-step the state (M) is definitely occupied. Together 
with the standard Markovian premise of chemical kinet- 
ics, namely, that once in a well defined chemical state 
the subsequent departures are independent of the mode 
of arrival, this crucial observation enables the system- 
atic calculation of splitting probabilities and conditional 
dwell times for general N > 1 via the Theory of First 
Passage Times: specifically, as we now explain, we may 
use the analysis as formulated by van Kampen [37j ] . 
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FIG. 1: Schematic graphs illustrating why the full-cycle interpretation is not adequate for describing stepping data: Plot (a) 
depicts a hypothetical time series of forward steps for an (N = 4) cycle with M = 2. The (0,l)'s indicate that the states [0]; 
and (1); are both certain to occur at least once between the pairs of states (3);_i and (2);. (See the text.) Then (b) represents 
a similar time series but with one major back-step. The question-marks between the forward-back and back-forward steps 
indicate that one does not know if the motor has ever passed through the states [0]; and/or (1); in these intervals. Finally, 
(c) and (d) depict two equivalent time series for a motor with TV = 3 and M = 1 but with different noise levels. The first 
plot allows one to identify only major forward and backward transitions at times tk (marked on the axis) while the states [0]; 
and corresponding substeps are hidden in the noise. However, the second plot reveals all the transitions and substeps, so that 
the t%, marking the beginning (or end) of each full cycle, can be determined. These schematic examples demonstrate that 
detectable transitions do not necessarily correspond to a full biochemical cycle so that a proper statistical analysis must take 
account of substeps hidden in the noise. Notice, indeed, that the three major transitions identified as forward and back steps 
at times £3, tt and £5 (on the left) are associated with only a single complete cycle from t° to £3. 



II. CONDITIONAL SPLITTING 
PROBABILITIES AND DWELL TIMES 

Before undertaking explicit calculations to obtain ex- 
pressions for 7r + , 7r_, and r + , r_, in terms of the Ui and 
Wj for general N and M, we introduce some further sta- 
tistical properties that are straightforward to observe ex- 
perimentally and might prove mechanistically informa- 
tive. At the same time, they enter naturally in to the 
first-passage analysis that is presented in Sec. IIII1 

In addition to the prior dwell times defined in © one 
may separately observe post dwell times by measuring in- 
tervals following after + or — major steps: we will label 
the corresponding mean dwell times r +0 and r_ , where 
the subscript o is read as 'diamond' and denotes, here 
and below, a + or a — step. However, such dwell times 
may be truncated by detachments (or dissociations or dis- 
connections) in which the motor leaves the track (essen- 
tially irreversibly) so ending a run. The rates of detach- 
ment from states (i), say Si, can certainly be included 
in the basic sequential kinetic model [HI, [U, ; but in 
the first instance they may be neglected provided, as we 



will suppose, only time intervals between observed + or 
— mechanical steps are considered. (Their effects, how- 
ever, would be significant if dwell times prior to detach- 
ments or immediately following attachments were consid- 
ered which might, indeed, prove informative.) 

Neglecting such "initial" and "final" dwell times (al- 
though the former have been examined by Veigel et 
al. for myosin V in seeking observable mechanical sub- 
steps 38]) one may still observe the four distinct condi- 
tional mean dwell times: 

t ++ : between two successive forward (+) steps, 

r_| : between a + step followed by a back-step, 

r |_ : between a back-step followed by a + step, ^ ' 

t : between two successive back-steps, 

defined, as in ©, in terms of the observed intervals 
= — t ( j^} 1 , averaged over n ++ pairs of successive 

+ steps, and likewise for n_| pairs of + steps followed 

by — steps, etc. 

Another aspect is to note that for realistic runs of lim- 
ited length, deviations of order 1/n will arise. Thus, for 
example, for a run of length n = n + + n_ starting with 
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a + step the overall mean dwell time is given by 
T = [(n+ - l)r+ + n_T_]/(n - 1), 



(13) 



there being only (n — 1) measurable (prior) dwell times 
T; = ti — ti-i. Using the definitions © then yields 



7T + T + + 7T_T_ 



tt_(t + - r_) 



1 



(14) 



In fitting asymptotic (n 3> 1) expressions to real data 
from finite runs such systematic deviations should be rec- 
ognized. Here, however, we will neglect such finite-n or 
end-effects. 

To proceed further it is also helpful to introduce the 

pairwise step probabilities 7r ++ and 7r_| defined as the 

probability that a + step is followed by a + or, respec- 
tively, by a — step, and likewise, ir |_ and ir These 

then satisfy 



= 1 



and 



7T_ + 



= 1. 



(15) 



Again, in a finite run of n steps one can divide the 
n — 1 successive pairs into n ++ of + steps followed by a 

+ step, and so on, and use 7r ++ » n ++ / {n ++ + ), 

7T_| w n_| / (ri-\ +n ++ ), etc. Noting that in a given run 

one must have \n-\ — n |_| < 1, and neglecting finite-n 

corrections, leads to the valuable relation 



7T+7T+_ = 7T_7r i_. 

From this follows the connections 

7T+ = 1 - 7T_ = 7T_ + /(7T + _ + 7T_ + ), 



= 1 



7T+ 



7T+_ 
7T_ + 



and 



= 1 



(16) 



(17) 



(18) 



Together with (fT"5|) these relations show that the pair 

7T_) and 7r |_ or, equivalently, 7r++ and it serve to 

determine all the back/forward or splitting probabilities. 

It is worthwhile to carry these considerations a stage 
further by recognizing the Markovian character of the ba- 
sic iV-state model ([I). Thus, neglecting detachments, the 

four division or splitting probabilities 7r++ , 7r^ , n |_, 

and 7T_- satisfying (fT5|) can be regarded as the elements 
of a 2 x 2 stepping matrix, [7T 0j g], that stochastically de- 
termines the transitions from one (major) step, + or — , 
to the next. By virtue of the conservation of probability, 
the largest eigenvalue is A = 1; but the second eigen- 
value, which determines the decay per step of step-step 
correlations, is just 



A, 



1 - 



1. 



(19) 



This vanishes when tt^ = n |_ = i which corresponds 

to 7r + = 7r_ and hence, to stall conditions in which the 
mean velocity, V, vanishes. 



Counting arguments similar to those yielding (fl3)) - 
(Unj) also lead to relations for the conditional mean dwell 
times. For completeness and consistency with later ex- 
pressions, we utilize the +/— ' 'diamond? notation intro- 
duced before. For the prior dwell times, we thus find 



T + = T 0+ = 7T + + T ++ 
T_ = T _ = 7T- + T+- 



«+-'-+) 
7T T , 



(20) 
(21) 



which, in turn, are fully consistent with the relation (J5|) 
for r in terms of r+ and r_ where we should note 



7r +0 and 7r_ 



7T + = 7T« 

Then the post dwell times likewise satisfy 

T+O = 7T++7++ + 7T + _T + _, 
T_o = 7T_ + T_ + + 7T T , 

while the overall mean dwell time is given by 

T = T 00 = 7T + T +0 + 7T_T_ . 



(22) 



(23) 
(24) 



(25) 



Each of these pairwise fractions and dwell times can 
be obtained from the same experimental data (i.e., step- 
ping time series) that have been used experimentally to 
obtain the step splitting probabilities and the prior dwell 
times in the course of studying the dynamics of a motor 
as a function of load and [ATP], etc. But by observing 
such further independent statistical parameters one can 
test the basic theory more completely and hope to obtain 
more reliable and constrained fitting values for the rates 
determining the full mechanochemical cycle. 

At a more detailed level it is also useful to define n p °, 

or — , as the mean number 



and 



with 



o, 



"3,— P' a 

of hidden forward and backward transitions, possibly hid- 
den, from states (i) and (J), respectively, in the intervals 
between (major) steps subject to the conditions specified 
by the pair (p, a) . If these transitions prove to be de- 
tectable, they can be counted and used in fitting param- 
eters; but if they pertain to hidden transitions (e.g., the 
hydrolysis of ATP, etc.), it is of interest to estimate how 
often they occur given specific rates. The appropriate 
calculations on the basis of the model {TJ) are developed 
below in Sec. ImEl 

It is appropriate here to mention various hidden- 
Markov methods, etc. [H HO, EL El; that have been 
derived and employed to locate steps in the presence 
of noise (and to fit their amplitudes, or kinetic parame- 
ters, etcA. These approaches require an input stochastic 
model [H, IH, [43[ ; we believe the present approach could 
provide a valuable complement in the extraction of ki- 
netic parameters from such experimental data since, as 
we will see, it reveals the kinds of behavior different mod- 
els can generate. 
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III. EXPLICIT CALCULATIONS 

A. Formulation and Notation 

The various stepping fractions, dwell times, etc., intro- 
duced in Sec. HI and iHl can be derived explicitly in terms 
of the basic kinetic rates by using van Kampen's analysis 
for onc-dimcnsional, nearest-neighbor first-passage pro- 
cesses 37]. Accordingly, following the basic sequential 
model , with the iV-periodicity conventions ((3]) for the 
sequential forward and backward rates, Ui and Wj, we 
envisage a random walker on a one-dimensional lattice 
with sites labeled i,j — 0, ±1, ±2, • • ■ , corresponding, in 
turn, to the motor states (i), (J), etc. (again subject 
to the periodicity convention). If the single major or 
observable step per cycle corresponds to the transitions 
(M) ^ (M+ 1) with M S [0, iV- 1] we introduce (follow- 
ing (37J) absorbing boundaries on the left and the right 
via 

L = M and R = M + 1 + N. (26) 

If, for given initial conditions at time t = (to be 
selected below), <ft(t) is the probability that the mo- 
tor/walker is in state (i) at time t we may construct the 
NxN transition matrix A = [Aij] with elements 

Aij = Uj5 i: j + i + Wj8i + i t j - (uj + Wj)5ij, (27) 

where i,j G [L + 1,R- 1] = [M + 1,M + N}. Then 
if q T = [qM+i>QM+2, ' ■ • > Qm+n] is the state vector, the 
governing rate equations are 



dq(t) 
dt 



Aq(t), 



(28) 



This completes the first-passage formulation [33]. Be- 
fore proceeding, however, we record some convenient no- 
tation for the various products and sums of the rate con- 
stants that enter the analysis. To that end, our first 
definition [44[ is of the (m^l)-term product 



r, m = n ^ 



(29) 



which, by periodicity, is invariant under I =4- l±N. Like- 
wise, the N-tevm product T^n is independent of I yield- 
ing, specifically, 



JV-l 



— Tut — TT — , 



(30) 



[4J]. Then for all I = 0, ±1, ±2, ... a central role will be 
played by the (n — l)-term sum 



(31) 



where, for the empty sum, we set A^i = 0. Indeed, 
these sums appear in previous analyses [H, [H, Q 
via "renormalized" inverse forward rates (or transition 
times) 

ri - u;\l + Ai, N ). (32) 
Specifically, these enter into the expression [3 [2l|, 

^/d = (i-r^)/E^o 1 n (33) 

for the mean velocity V, which we recall here for con- 
venience of reference. (Note that d is the mean spac- 
ing of sites along the track.) One sees directly from 
this that stall conditions, i.e., V — 0, are determined 
by TN(ui,Wj) = 1. The situation near stall will be a 
major focus for our discussions in Sec. IIVI 

The analysis of van Kampen [37| may now be put to 
work. Readers uninterested in the details may skip to the 
next section or peruse the main results, namely, (|37p - (|4Tj) 
for 7T++, etc., ([53 |) - (|5?]) for r ++ , etc., and l[HD )) - ([62|l for 
r + , r_ and r. 

B. Pairwise step splitting probabilities 

To proceed, let irj^ be the total probability that a motor 
starting at t = in state (k) with L < k < R, so that 
<7i(0) = Sik, eventually reaches the left absorbing state 
(L) for the first time, i.e., without having been absorbed 
at sites L or R. A moment's thought confirms 



(34) 



71 'k = w l+i \ qL+i(t)dt; 



likewise, for reaching the boundary state (R) for the first 
time, one has 



Ur-X 



q R -i(t)dt. 



(35) 



Then we may appeal to Ref. [13, Ch.XII, Eq.(2.8) which, 
using the notation (|2"9")l . states 



l^ m =k-L 1 L 



1 



(36) 



For L and R we have (|26|) . If the motor starts just 
after a (major) forward or + step it is in the initial state 
(k) = (M+ 1); then n% and ttJ} correspond, respectively, 

to 7r^ and 7r ++ . Conversely, just after a (major) back or 

— step the motor is in state (M) which, by periodicity is 
equivalent to starting k = M + N; then we may identify 

7r^ and wj} with tt and n |_, respectively. On using 

the notation (l30l) and (13T1) we thus obtain 



n ++ (M,N) = l-7T+_ 

7T- + (M,N) = 1 - 7T__ 



1 



1 + + Am,at 
1 + Am, at 



1 + Tjy + A 



, (37) 
(38) 
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It is interesting to note the cross-relation 

7T = T N TT ++ . 



(39) 



7r |- which 



At stall this implies 7r ++ = tt and 

reflects the expected +/— symmetry. The results (|37|) 
and (f38|) also yield, via the identities (fT7|) and (fl8|) . the 
explicit step fraction expressions 



1 



7T+ 



TV 



1 + Tat 

Am at 



2A 



M.N 



i + r 



A 



2A 



A/. AT 



(40) 
(41) 



C. Pairwise and prior dwell times 



Now, following van Kampen [37j, Ch.XII, Eqs.(1.7)- 
(1.8), the conditional mean first-passage times for arriv- 
ing either at the left or right absorbing boundaries start- 
ing from a state point k E [L + 1,R-1] = [M +1.M + N] 
as before, are 



f™tq L+1 (t)dt 
jo°° QL+i(t)dt '' 



tq R ^ 1 (t)dt 
f™qR-i(t)dt ' 



(42) 



where the qi{t) are the solutions of (|28p subject, for our 
purposes, to the two, alternative initial conditions, 



«i + (Q) 



,M 



+i and g i (0) = S^m+n, (43) 



as discussed in deriving (|37|) and (|38jl . 

To obtain the pairwise conditional mean dwell times we 
proceed in two steps. First, we integrate the kinetic equa- 
tions (|28|) over all times recognizing that the qj{t) ap- 
proach zero exponentially fast for all j G [M + 1, M + N] , 
since the walker must eventually be absorbed at either L 
or R. This yields 



q ± (Q)=AT ± , 



(44) 



where the superscripts identify the alternative initial con- 
ditions (l43l). The elements of the vector have the 



dimensions of time and are given simply by 

q±(t)dt. (45) 
with (j2"6"|) , we also have 



By the definitions (j3"4"| and 
the relations 



± M+l 



7T+_/uJm+1, 



M+l 



Tm+ iV — 7r + + / + jV , J- m+ JV 



= tt — /w M+ i, (46) 

= 7T- + /um+N- (47) 



Since A is a tridiagonal matrix the equations (|44[) can 
be inverted recursively to obtain 



t; 



^++ rr-F — > ( 48 ) 



1 + Amj-m 

"j^M^'-M 



(49) 



One general approach to this inversion can be found in 
Ref. [37]], Ch.XII, Sec. 2. On recalling A ; i = 0, one may 
check that these solutions verify the relations (|46| and 
63). 

The next step is to multiply (|28[) by t and again inte- 
grate over all time which yields 



AS'- 



(50) 



where, with the same superscript conventions, etc., we 
have 



tq±{t)dt. 



(51) 



From (|42f and following the arguments above (|3T[) and 
|), we can now make the identifications 



,L + 

k 

R— 

k 



— T +- — S M+1 /T M+1 , 
m+n/Tm+N' 



(52) 
(53) 



and similarly for r and r ++ . Inverting (|44p finally 

leads to the basic pairwise dwell time expressions 



M+N 

£ T+(1 + A M ^ M ), 

j=M+l 

M+N . 

E t + tiMj-M 

i 1 + A fl 

j=M+l 



T+- 



M+N 

E t- s 

j=M+l 



N 



Am,jv 



W,j-Af 



M+N 

E 25 

J=A/+1 



rjy + Am,n 

- + Am,w — 



r 



A" 



(54) 
(55) 
(56) 
(57) 



By using ([37| and ([38| for tt++ and 7r__ in (|48j) and 
49|) we may establish the unanticipated, general equality 



all M, N. 



(58) 



En route to the prior dwell times r+ and t_ it is con- 
venient to introduce 



7r_r; 



(1 + Aj, jy)A 
I+Tat + 2 A 



M,N 



(59) 



where we have used (|40l) and l|4"Tj) and may recall that the 
numerator is the inverse rate defined in (|32p and used in 
the past. Note also that by virtue of the periodicity we 
have Tj±N = Tj and, likewise, for the Tj . Then, by 
utilizing ([20|). ([jl"jl. ([37]) . and (J38J we obtain the prior 
dwell times in the form 



M+N 



T+ = T 0+ 



1 \ „ 1 + Am.j-m 

7T+ ^ J l 

_^ M+N 

V E 25 



i=Af+l 



r^v + Am,n ' 

+ Am,at — Amj-a/ 
1 + IV + Am,at 



(60) 
(61) 
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Finally, with the aid of the overall mean dwell time 
is simply 



M+N 



N-l 



E T i = E T >- 



(62) 



j=M+l 



i=0 



Although, we have introduced the various step frac- 
tions via the pairwise fractions 7r ++ , etc., this was not 
a necessary move from the mathematical point of view. 
Indeed, one can the results (|4*0")) and (jlTj) for ir + and 7r_ , 
and the present expressions for r + , r_ and t, directly 
by solving the basic rate equations (|28[) with the initial 
conditions 

q k (0) = 7T + 4^ /+1 + (1 - 7T + )4 3 , /+A r, (63) 

together with the relations (f44|) and (|50|) and appropriate 
boundary conditions. By this route one need not men- 
tion the pairwise splitting probabilities or pairwise dwell 
times. Nevertheless, the pairwise stepping fractions and 
dwell times can be useful in data analysis and to test 
theory, since they represent additional force and [ATP] 
dependent parameters that can be measured without sig- 
nificant extra experimental effort. 



D. Individual and post dwell times 



Now the form (|62|) for r strongly suggests that Tj = 
T?° is actually the overall individual mean dwell time 
spent in state (j) irrespective, as indicated by the use 
of the +/— or diamond notation [see (|20|) - (j25|) ] of the 
stepping sequence. This conclusion is, indeed, justified 
since it follows from (|45|) that we may identify = T^ +0 
and TJ = TJ as individual post + and — step mean 
dwell times in state (j), respectively. Consequently, the 
mean overall post dwell times, are given by 



N-l 

E 



and 



N-l 

E 



t; 



(64) 



From these results one may verify that (|25|) is satisfied. 
Likewise, we anticipate relations like 



N-l 
r ++ = E 

4=0 



(65) 



etc., and, hence, from (|54[) and (|57) we surmise that the 
conditional individual state dwell times are 



rp + + _ rp — 

3 3 



(1 + A M >j _ M )(r JV + A M>N - A 



M,j-M) 



u j Fm,3-M (1 + + Am,n) 

while ((56]) and ((55]) yield 

rjv(l + Amj-m) 2 



3 UjTmj-m^ + Am,w)(1 + Tat + A m ,n) 



(Tn + Am,n — Amj-m) 



3 UjT M ,j-M(^N + Am,tv)(1 + Tn + Am.tv) 



, (66) 

(67) 
(68) 



In terms of these we can define the prior individual (or 
partial) dwell times via 



n + = 7T++T+- 



+ Am,j-m) 



(1 + A m ,jv)(1 + Tjv + A m ,jv) ' 



(69) 



rpO- 
j 



-TJ 



(T N + A m ,at)(1 + T N + A m ,n) ' 



(70) 



where the r 3 - are defined in (|3^|) . Hence one can check 
the expressions for the mean overall prior dwell times, 
t 0+ and t _, given in (|6"0|) and (|6T|) . 

Evidently, the analysis presented does not fully justify 
the inferences regarding ([66 |) - ([70|) . However, these ex- 
pressions have been checked by direct computation for 
N = 2 (as recorded in Appendix A) and the various 
cross-checks for general M and N also serve as valida- 
tion. However, a complete justification requires a more 
elaborate calculation that we hope to present in the fu- 
ture. By the same route one can derive the mean condi- 
tional counts, n p ° + and n p °_ [see after Eqs. (|2"3")l - (|2"5j) ] of 
the hidden substeps as we now proceed to demonstrate. 
Corresponding results for N — 2 are also presented in 
Appendix A. 



E. Counting the hidden substeps 

As touched on briefly in the penultimate paragraph of 
Secini it is surely of interest in light of our basic premise 
to estimate for a particular model how many hidden sub- 
steps actually arise on average in the typical intervals 
r = t 00 , t +0 , r ++ , etc., between specified successive ob- 
servable steps, i.e., major transitions between states (M) 
and (M + 1). Granted the results obtained in the previ- 
ous subsection for the T P<J , where henceforth, (p, a) runs 
through the nine combinations 



{pM = { 



oo; +o, — o, o-K o— ; 



-}, (71) 



this is a reasonably straightforward exercise. 

Notice, first, that the mean dwell time in a given state 
(j), say fj , is, by virtue of the Markovian character of the 
relevant biochemical reactions independent of whether 
the state was reached from state (j — 1) or (j + 1) and 
of whether the motor departs to states (j + 1) or (j — 1): 
formally, we may write 



{Uj 



i 3 



•(-) 



,(+)-(+) 



,(+)-(-) 



(72) 



where the superscripts have an obvious interpretation. 

,_i are the number of transi- 



Then, if n, 



+j+i and rij- 



tions, i.e., substeps, forwards or backwards, respectively, 
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from state (J) in an interval between observable steps we 
desire the conditional moan values 

= (nj^j+i) p<7 and n %_ = (n,wi-i) pCT . (73) 

In terms of these we will have for the overall mean num- 
bers of + or — hidden transitions per step-to-step interval 



1 



M+JV-1 



j=M+l 



M+N 



N 



(74) 
(75) 



i=M+2 



for all nine pairs (p,o~). As a moments thought reveals, 
the limits on the summations here must be carefully set: 
thus, after a forward step to state (M + 1) any puta- 
tive forward hidden substep transformation from state 
(M + N) would represent a full observable + step (i.e., a 
major transition) and so is to be excluded from the sum of 
substeps; equally, any back transition from state (M + 1) 
would represent an observable (back) step whereas back 
sw&steps from state (M + N) , prior to the final forward 
step, are to be counted: See, for example, Fig. [3] which 
can be regarded as a noise-free version of Fig. Hfd) [but 
for an (N = 4)-state model]. 

Now consider Tj° , the mean time spent during a 
(p, a) step interval in a state (j) that is neither an 
initial nor a final state of a major transition, so that 
j ^ M, AI + 1. On average this state will be visited on 
T? a /fj — {uj + w^Tj" 7 separate occasions during the 
interval; thus it entails n^ a + forward substep depart 
and 



back substep departures. Equally, it entails, 



on average, n? a _ 1 + forward substep arrivals and Tij +1 _ 



back substep arrivals. As a result we have the frequency 
relations 



UJ 



pa 



(u>j + Wj)T: 



pa 



n 



pa 



n 



pa 



for j e [M + 2, M + N - 1] and, similarly, 



per per 

w 3 =T *i-i, 



7? 



'3+1,-' 



(76) 



(77) 



where the units of ujj may be regarded as substeps per 
interval. 

For the reasons explained after (|75p and illustrated in 
Fig. [2] the 'boundary states' j = M + 1 and j = M + N, 
require special consideration. As noted, one cannot have 
STi&steps that go backwards from state (M + 1) and, in 
the case of a prior forward step, the certain incoming 
arrival must be included in the individual mean dwell 
time. Thus we are led to the four boundary frequency 
relations 



Complementary arguments apply for the opposite bound- 
ary state (j) = (M + N), yielding 



UJ 



M+N 



= n 



M+N,- 



1 = n 



M+N-\,+ i 



(82) 



U M+N ~ n M+N ,— ~ n M+iV-l,+ > (83) 



M+N,- 



M+N 



II 



M+N, 



= n 



M+N-1.+ 

1. 



M+JV-1, + 



(84) 
(85) 



The frequency relations ([76"| and ([77)1 with the bound- 
ary relations (I78p - (l85|) constitute, together with the def- 
inition — (v,j + Wj)T^ a in (|76[) . a complete set from 
which we may derive explicit general expressions for all 
the and _. To proceed, we note first that the 
purely counting arguments involving 7r + , 7r_, and tt++, 
7r__, etc., that led to the 'reduced' relations ()20l2ip 
and (|23I24[) . apply equally to uJj~ + , ujj , etc. Accord- 
ingly, we obtain the post boundary frequency relations 

(86) 
(87) 
(88) 
(89) 



, 1+° 

^M+l 


- 77+° 


-2,- + 


1 =</+!,+ +7T+-, 


W M+N 


- 77 +° 

— ' l M-\ 


-JV-1.H 


- = n ~M+N- +1"++, 


—o 
U M+1 


— n M4 


-2,- = 


n M+l,+ i 


— o 
^M+N 


— n M4 


-JV-1.H 


- + 1 = n M+N- + n - 



while the prior relations are 



(90) 



W M+/V — 71 M+N,- + 1 — fl Af+JV-l,+ + ^ — ' (^1) 



^Af+1 



n M+l,+ + 1 — ra 



M+2,- 



(92) 
(93) 



Finally, in analogy to (f25|) , we obtain 



OO OO i OO i ( c\ A \ 

W M+1 = n M+2,- + 7T+ = ™M+1,+ + (94) 

OO OO i OO i /nr\ 

%+« = «M+/V-1 : + + T- = n M+N,- + 7T+- (95) 

Now clearly implies the result 



7) 



A/+1, 



A/ 



+1 - 7T_ = (Mj + Wj)T M+1 - 7T_ 



(96) 



where 7r_ and T$ +1 = T^ +1 ({uj,Wj}; M, N) are given 
explicitly in (|4l|) and (|59| . A similar result follows 
from (f94l) for n^| +2 _ . But by appealing to (|77f for 
j = Af + 2 we also obtain 



'M+3,- 



y A-f+2 



l M+l, + ! 



(97) 



which, with the aid of (|96|) . yields an explicit expression 
in terms of Tm+i and Tm+2- Substituting j = M + 3 
in ([75)1 then leads to an expression for n^ +3 , . By pro- 
ceeding recursively in this fashion we eventually obtain, 
for k = 1, 2, • • • , the general expressions 



A/+2fc-l 



,++ 

Af+1 


- 77++ 


- 77+ 4 


f2,-+^ 


(78) 


OO 

n M+2fe,- — 


'm+i 


= n M+l,+ 


+ 1 = 


Tl Af+2,- + 1' 


(79) 


3= 


'm+i 


= n A-f+l,+ 


= n M 


f2,-> 


(80) 


OO 

n Af+2fe+l,- — 


'm+i 


= n M+l,+ 


+ 1 = 


n M+2,-- 


(81) 





l ( Mj+U ; J )r; <> -7r + , (98) 



Af+2fc 

n j - M («j+vj)T+*-, (99) 

j=Af+l 




FIG. 2: Schematic plots illustrating substeps and their counting rules (see text) in the N = 4 case with M = 3 for (++), (+ — ), 

( — +), and ( ) stepping intervals. Thus, for example, from the first plot one confirms Uq"^ = nft. + 1 and, correspondingly, 

that the motor visited the state [0] three times in this realization; and, likewise, + + = n o+ + n 2~i i s i here, equal to 
4 so that, correspondingly, the motor spent time in the substate (1) on four occasions before undergoing the forward major 
transition (3)— »(4 = 0). Similarly, the other plots serve to establish the frequency relations (|76l) - (|85| ). 



for the average number of hidden back-substeps from the 
N-2 hidden intermediate states (M + 2) to (M + N — 1) 
and from the pre-step state (M + N) = (M). We may 
also note the sum rule 



N-i 
i=o 



(-)'(«, +w t )T? 



for N even, 

1 for N odd, 



(100) 



and the special relations 7r_| 



um+nT^ +n and 



7r_ = w M +iTlf +l , so that n^ +1>+ = u M +iT£? +1 and 

n M+N- ~ Wm+N-Lm+N- 

More generally and can be extended to ar- 
bitrary (p, a) if, following the sequence ([TTj) . 7r + and 7r_ 
are replaced by 



^+ = {7T+; 1, 0, 7T++, ^_+; 1, 1, 0, 0}, (101) 
7f_={7r_; 7r+_,7r__,0,l; 0,1,0,1}, (102) 



IV. DISCUSSION AND ILLUSTRATIONS 

First, it is appropriate to look more closely at the dif- 
ference between our present results [see pOjl . BTt . and 
([50)1 - (1521 1 and those of the previously published analy- 
sis [19|, l2Jj- As in Sec. [I] [after (JTTJ)] we use a hacek to 
distinguish the results that presuppose the completion 
of a full enzymatic cycle between all pairs of successive 
major, i.e., observable steps. In the present notation one 
then has (Ml 



1 



r 



JV 



1 + rV 



l + r 



N 



(105) 



for the forward/back fractions while the mean, prior, and 
post dwell times are all given by 



JV-l 



f = f+=f_= ^r n /(l + T N ), (106) 



respectively. Finally, one finds that the mean number 
of forward substeps from the N — 2 hidden intermediate 
states and the post-step state (M + 1) in specified inter- 
vals can be written 



M+2k 

>M + 2 k , + = E + ™;) T f + ^ ( 103 ) 

j=M+l 
M+2fc-l 

< + 2k-i,+ = E (-r^K+^oT-^-- ( 104 ) 

j=M+l 

for k — 1, 2, • ■ • . Some examples of these various expres- 
sions for small N are listed in Appendix A. 



where r„ is defined in (J32J) . On the other hand, when 
allowing for hidden substeps, these three times are all 
distinct: see (j60"l) - (|62]) . 

Then, if one considers the average velocity defined by 

V = d{it + -7T_)/r, (107) 

one finds that the result is the same in both cases, namely, 



N-l 



v = v = d(i-r N ) 



(108) 



as previously [1J, LLSl, l2fj, [2l|, [22| . This is not really sur- 
prising, because the asymptotic ratio of the total distance 
to the total time should not depend on the way one takes 
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into account (or ignores) substeps. Note that Tjv = 1 is 
the condition for stall, when 7r + = 7r_, in both analyses. 

Now let us take a closer look at the ratio of splitting 
fractions, because this is a quantity which one can readily 
obtain from stepping observations to test theory. The 
effect of an external force, F = (F x Fy, F z ), on the rates 
can be expressed in leading order [18l. T23I] as 



m = u°exp{+d8+-F/k B T), 



Wl 



= w^ exp(-d0f -F/k B T), 



(109) 
(110) 



where u®,w® are the load-free rates while the load- 



distribution vectors 6^ and 6~[ serve to specify how the 
27V distinct rates respond to the stress. The periodicity 
of the stepping along the track (which we suppose is in 
the x direction) implies 



JV-l 



E K+On) =e, = (1,0,0). (Ill) 



Rearranging these relations leads to 



JV 



n 



Wr, 



exp 



dF x 
k B T 



n=0 

= exp[-d(F x - F s )/k B T], 
where the stall force is given by [HI, [23| 

k B T 



N-l 

n 

n=0 



IV, 



N-l o ' 



(112) 



(113) 



From (|105p we thus conclude 

ln(7r + /#_) = - lnlV = d(F x - F s )/k B T. (114) 

This is a strong prediction since it asserts that the log- 
arithm of the stepping ratio (7r + /7r_) depends linearly 
on F x with a slope, d/k B T, determined solely by the 
step size d. However, this result depends crucially on 
the assumption that the forward and back steps identi- 
fied correspond to full enzymatic cycles (as the haceks 
indicate). In a typical experiment, however, the hidden 
substeps result in a violation of this assumption as we 
have explained. 

Accordingly, let us, instead, compute the stepping frac- 
tion (7r + /7r_) for which (gDJ) and (gTJ yield 



ln(7r+/7r_) = d*(F x - F s )/k B T, 



(115) 



where, for the sake of comparison with (|114p . we have 
introduced an effective step size 



d* = - kBT In fTN 



F x -F s 



*M.N 



(116) 



In general, this clearly depends on all the rates {uj, uij} 
and hence on the force F x . In the special case Am,n = 0, 
however, d* reduces [via (|114p ] to d. This condition is, in 



fact, realized when TV = 1 since Am,i vanishes identically. 
But an TV = 1 model is unlikely to be adequate. Thus 
in real systems neither the full linearity vs. F x nor the 
equality d* — d are to be expected. In the vicinity of F$, 
however, we can estimate d* by expanding in powers of 
SF X = F X -F S . This yields 



T N =l- dSF x /k B T + 0{5F%), 
A m ,n = A s + A' s 5F x + 0{6F%), 



(117) 
(118) 



where As — Am,n{F x = Fs) while A' s is the correspond- 
ing derivative. Thence we find 



1 



0(SF x ) 



(119) 



It follows from the definition (f3"Tj) that As cannot be 
negative so that, quite generally, one has d* ^ d. 

On the other hand, to be concrete, consider an (TV = 2, 
M = 1) model for which we have 



A s = (w /u )s = {ui/wi) s 
= KK)exp{d[F s -(0 ( ] 



■0o).F s ]/k B T}, (120) 



where the subscript S denotes evaluation at F = Fg = 
(F x = Fs, F y ,F z ) in which F y and F z need not vanish at 
stall [23|. It is evident, that As is not bounded above so 
that d* /d is not bounded below. Indeed, the experiments 
on kinesin of Nishiyama et al. [7| and of Carter and Cross 
on kinesin [l3| lead to the estimates d* ~ 3.2 nm and 
d* ~ 4.0 nm, respectively, whereas the step size is d ~ 
8.2 nm. This clearly indicates the importance of the 
hidden substeps in understanding the stepping fractions 
near stall. 

As another concrete example we quote 



VJqWi 



Uo U Ui 

UIU2U3 U2U3 



W0W1W2 \ 

U0U1U2 J s 



WIW2W3 W2W3 



U3_ 

Ui 3 



(121) 



for an (TV = 4, M = 3) model. From this, however, 
one sees that d* will be close to d if (t«3/w3)s and/or 
(wo/uq)s are sufficiently small. 

More generally it seems likely that both the full cy- 
cle assumption and the hidden substep analysis should 
produce similar results when the rates, um and wm+x, 
for the major transition are slow relative to the substep 
rates. To demonstrate this explicitly let us suppose that 
the latter rates satisfy 



u M = r)UM and w M +i = p M +i, 



(122) 



where rj is small while ilm and wm+i and all the substep 
rates, ui =ui (I ^ M) and wi = wi (I ^ M + 1), are held 
fixed. Then one finds 



n 

7i=l 



UM+n 



r/r M , s for se[i,TV-i], 

f N = T N for s = N, (123) 
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FIG. 3: Plots illustrating different dependencies on the normalized force, F/\Fs\, of (a) the normalized velocity V(F)/V(0); 
(b) the forward splitting fraction ir+; (c) the logarithm of the splitting fraction ratio (iv+/n-); and (d) the ratio of mean 
prior backward to mean total dwell time, t-(F)/t(F). The individual plots, labeled (1) to (5), correspond to (N = 2, M = 1) 
models under scalar loading, F=(F, 0,0), with the selected parameter values presented in Table I. 



where r M;S = f]* =1 (w M+n /u M +n), and similarly, 
via ([51]). 



JV-l 



M,N- 



(124) 



With the aid of these expressions we can rewrite the pre- 
vious results (|40I41|) and (JB2J) as 



TT + = = = = 1 — 7T_, (125) 



1 + T N + 2t)A m ,n 

1 + Tjv 
1 + f at + 2 v A m ,n ' 



(126) 



Evidently to zero order in r], the analyses are equivalent 
as anticipated. 

As seen in earlier investigations that were confined to 
the velocity vs. force relation [lfl [2l[ , a surprisingly wide 



range of behavior under varying loads is displayed even 
by the basic (N = 2)-state models when the rates are sub- 
ject to the exponential force distribution laws embodied 
in (|109«llip [lillll]. This is illustrated in Fig.3(a) which 
depicts the velocity V, normalized by its value V(0) un- 
der zero load, as a function of the imposed load F (sup- 
posed parallel to the a;-axis) normalized by the stall force 
magnitude \Fs\. The labeled plots (1) to (5) in Fig.[3]cor- 
respond to the selected parameter values listed in Table I. 
Note that superstall loads are included (F/\Fs\ < — 1) 
which for the parameter sets (1) and (5) results in only 
a relatively small negative velocity (as uncovered in the 
kinesin experiments of Carter and Cross [H[). Similarly, 
assisting loads (F/\Fs\ > 0) are also covered and for sets 
(2), (4), and (5) result in a saturating or, even, decreasing 
velocity under increasing load. In the substall resistively 
loaded region (0 > F/\Fs\ > —1) convex, concave, in- 
flected, and even nonmonotonic [see parameter set (5)] 
behavior is realized. 
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TABLE I: Parameter values for (N = 2, M = 1) models 
under scalar loading, F = F x , employed in Fig.[3]to illustrate 
different force-dependencies. The last column presents the 
computed values of the effective step size d* . [See (|116[1 .] 





d\F s \/k B T 






ot 


e - e 1 


d*/d 


(1) 


9.2 


10"* 


10"* 


0.5 





0.501 


(2) 


2.5 


10~ 2 


10~ 2 





0.5 


0.966 


(3) 


23 


10" 5 


10" 5 


0.07 


0.43 


0.503 


(4) 


23 


10~ 5 


10" 5 


-0.07 


0.48 


0.113 


(5) 


10 


3.4xl0 -4 


2.5xl0" 3 


-0.1 


0.1 


0.118 



For these different cases the corresponding forward 
stepping fractions tt+(F) and the logarithmic ratios of 
forward/back steps, (tt + /tt^), are depicted in Figs. 3(b) 
and (c). [Note that in these figures only the resisting 
range of force, F < 0, is displayed.] Although the vari- 
ation is always monotonically increasing with F (and 
7r + = 7r_ = | when F = Fs), a wide range of forms 
is evident. In the logarithmic plot, Fig. 3(c), one sees lin- 
ear, concave, and inflected variation close to stall. Fur- 
thermore, the value of the effective step size d* varies 
markedly: see the last column in Table I. 

By using the explicit expressions in the Appendix the 

nature of other statistical observables such as ir^ (F), 

etc., is readily explored. Experiments often measure the 
overall mean dwell time, t(F), between steps. Under 
assisting loads (F > 0), when 7r_ is negligible, t(F) di- 
rectly mirrors the reciprocal of the velocity V(F); but, in 
view of the factor (tt+ — 7r_) in (|107|> . it varies somewhat 
differently under resisting loads. More interesting is the 
behavior of the partial dwell time t_ (F) observed prior to 
back steps. This is shown in Fig. 3(d) normalized by the 
overall dwell time t(F) = tt+t + + 7t_t_ [sec ©]. Even 
though, the ratio t_/t is confined to the range (0,2) 
beyond superstall (since t_/t < l/7r_ and tt^ > ^ for 
V < 0), striking nonmonotonic and inflected variation 
arises. 

Needless-to-say, many more plots exhibiting unex- 
pected and surprising behavior can be generated; but 
further exploration seems most useful in connection 
with specific experimental data. Such applications are 
planned. 



V. MULTIPLE OBSERVED TRANSITIONS 

In the previous sections we have derived expressions 
only for the case of a single major transition in each en- 
zymatic cycle; that, indeed, is the typical situation for 
experiments on conventional kinesin [f| d, 0, EH ■ How- 
ever, our results can be generalized to the case in which 
several substeps are sufficiently large to be clearly de- 
tected, while others remain hidden in the noise. Suppose 
there are K visible substeps of (average) magnitudes d-A, 
ds, ■ ■ ■ , together totalling 



that occur between states (Mj) and (Mj + 1) with, in 
sequence, 



< M A < M B < ■ ■ ■ < M K < N - 1. 



(128) 



Then between states (Mj_i + 1) and (Mj) there are Mj — 
hidden states and, consequently, all these states 
belong (within the noise) to what we may call the same 
mechanical level, J: see Fig. 2] 

Now one can count all forward and backward de- 
tectable transitions in a long run. Accordingly, let n J ++ 
be the number of pairs of observed substeps that enter 
the mechanical level J via a 4- transition, i.e., a step 
(J — 1), and leave via a + transition taking a step J, and 
similarly for n ,r _ + , and n ,] __. Then, as previously, 

we can estimate pairwise splitting fractions for level J 
via 



+ + — 1 _ tt J 

n± , + n±_ 



j 



n J _ + + n J __ 



= 1 - 7T" 



(129) 
(130) 



Likewise, we can introduce pair- wise dwell times, T J pa , for 
each mechanical level J as the mean times spent between 
a p and a substep into and out of that level. 

With these definitions we may adopt the same ap- 
proach used in Sec. IIIII by setting absorbing boundaries 
at states (L) = (Af/_i) and (R) = (Mj + 1) around 
each level J and studying the appropriate first-passage 
processes. If we set 



Nj= \M A -M B \, \M B -M C \, ■•• , 
\M K -M K _ X \, \N-M K +M A \, 
for J = A, B, ■ ■ ■ , K - 1, K, 



(131) 



we can then conclude, using the previous results (|37l38p . 
that 



'per 



1 pa 



(M, 



7-1 



N 



J-i 



(132) 



Similarly, recalling the definitions (|48l49p for T+(M,iV) 
and T~(M, N) and the results (|54H57j) . we obtain 



t (Mj^,Nj_ x ). 



(133) 



IK 



(127) 



The counting of individual hidden substeps developed 
in Sec. IIII El can be carried forward to obtain the mean 
number of substep transitions between two specified ma- 
jor transitions. The results (|98ti 104|> essentially apply di- 
rectly with M Mj_i and N => 

In terms of the conditional individual state dwell times 
introduced in (|66H68[) we also have 

Mj 

r/ CT = Y, n°{Mj-x,Nj- X ). (134) 

fc=Mj_i+l 

as a measure of the observable mean overall time spent 
in the mechanical level J subject to the (p, a) conditions. 
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FIG. 4: Schematic depiction of individual biochemical states, (i), organized into mechanical levels, A, B, ■ ■ • , J— 1, J, • • ■ when 
there are K > 1 detectable substeps or major transitions with substep J going forward from state (Mj) to state (Mj + 1). 



As regards potential applications of these results, the 
case K = 2 may be reasonable for a first analysis of data 
for myosin V where, as mentioned, a significant observ- 
able substep was originally predicted [19| and later ob- 
served however, the experiments also suggest [Tol [TT| 
that stepping may proceed through two (or more) alter- 
native pathways so that a purely sequential model (to 
which our attention has been restricted) may be inade- 
quate [26j |. For the Fi-ATPase motor [28, 2^, H2 substeps 
have also been reported and occasional back steps have 
been observed. Thus our results should be applicable. 



VI. SUMMARY AND CONCLUSION 

As explained in the Introduction, the need to develop 
the hidden substep analysis we have presented arises 
from the fact that experimentally detectable steps in 
the motion of a motor protein along its track do not 
necessarily delineate the completion of full biochemical 
enzymatic cycles. As a consequence, previous analy- 
ses that addressed such observable statistics as back- 
stepping fractions, 7r_ , and mean dwell times, t + and 
t_, measured prior to forward (or +) and back (or — ) 
steps, were not adequate to relate the underlying rates 
in a biomechanochemical model, say Ui and Wj to the 
experimental data. 

We have considered the basic A-state sequential ki- 
netic model set out in (JTJ) and specified by N forward 
rates Ui from biochemical state (i) and N reverse rates 
uij from state (j). In general, the rates depend on 
the concentration of various reagents [see, e.g., ©] and, 
in particular, vary experimentally with the load force 



F = (F x , F y , F z ): see (I109I11UI) . 

The basic problem may then be set up by supposing 
that as the motor progresses (or retrogresses) along its 
molecular track only a single "major transition" from 
state (M) to (M + 1) (0 < M < A), or its reverse, 
corresponds to a "visible" or detectable "step" in the N- 
state cycle. All the other transitions are "hidden": see 
Fig. [TJ [Cases in which more than one major transition 
or observable (sub)step occur in each full enzymatic cy- 
cle are analyzed in Sec. [V]] It then transpires that two 
crucial combinations of the rates {ui,Wj} play a central 
role, namely, Tjy and Am,n as defined in (j2"!)|) - (p?l~j) . 

Indeed, explicit expressions for the forward and back- 
ward stepping fractions, n + and 7r_, are derived in terms 
of Tat and Am, at in Sec. IIII Bl and presented in (|4TJ|) and 
([4"Tj) . It proves helpful, furthermore, to relate ir + and 
7r_ to the conditional or pairwise step probabilities, 7r ++ , 
7r + _, etc., for + steps followed by a + step, or by a — 
step, etc., that can be defined via counting observations, 
as explained in (fT5|) - P^|) and the associated text. These 
pairwise probabilities are likewise expressible in terms of 
T N and A m ,n- see J37J and ([38]). 

From these results one can see - as explained in further 
detail in Sec. lIVI - that only when Am.n «C min{l, T^} 
can one neglect the hidden substeps without risk of se- 
rious error. Particularly instructive is the variation of 
ln(7r + /7r_) as the load F x passes through stall (at which 
7r + = 7r_ = i so that the mean velocity vanishes). One 
may then define an effective step length, d*, via Q115P 
and (|116|) . When Am,n —* (or if hidden substeps are 
ignored) one has the simple equality d* = d, where d is 
the full step size of the motor (per cycle). But, in fact, 
d*/d must always be less than unity and, as seen in ex- 



15 



periment and illustrated in Table I, this ratio is typically 
of magnitude only 0.3 to 0.5. 

Going beyond simple counts of backward and forward 
steps, one may also define conditional mean dwell times, 
t ++ , T_| , etc., for the time spent between a pair of suc- 
cessive + steps, or a + step followed by a — step, etc.: 
see (TT21 . These pairwise mean times can likewise be cal- 
culated [see (|54l) to (|57[) ] in terms of individual- state post 
dwell times, T+(= if*) and Tr(= Tr°), that represent 
the mean time spent in a state (j) following a + or — 
step, respectively: see Sec. IIIIDl for a fuller explanation 
of the notation, etc. The corresponding explicit expres- 
sions, (gH|) and (03), involve the basic rate products T; im 
and their sums, A/, n , as again defined in (|29|) and (|3Tj) . 
The final results for t+, t_, and for the overall mean 
dwell time r (between + or — steps) for general M and 
N entail slightly simpler sums: see (j59")) - ([6"2"j) . 

More transparent formulae for the stepping fractions 
and dwell times for N — 2 models (involving only the 
rates uq, u\, wq, and wi,) and for selected N = 4 models, 
are presented in Appendix A. In addition, the parts of 
Fig. [3] and the associated discussion in Sec.|lVl illustrate 
that a wide range of different types of behavior of n + (F), 
]n[ir+(F)/ir-(F)], and T-(F) as functions of load can be 
realized even within simple N — 2 models. 

At a higher level of detail, conditional individual-state 
dwell times, T++, T+~, ■ ■ • , can be derived [see (l66|) -(l68l)] 
and, likewise, post (as against the previously mentioned 
prior) dwell times, r +<> and r_ : see (|64[) . Finally, one can 
obtain the expressions ([98]). (|99| . (fT03| . and (fl04|) . for 
the mean number, n^ a + and n^ a _ of hidden, forwards and 
backwards, substeps from an individual state (j) that oc- 
cur in a time interval between detectable steps, i.e., ma- 
jor transitions specified by {p, a} — {■ ■ ■ , H — , — +, • • • }: 
see (ITO . These results provide quantitative estimates for 
the number of "lost" or "hidden" transitions occurring in 
an enzymatic cycle. Such information could be of partic- 
ular interest for real motor proteins since, when operating 
in cells to achieve mitosis, etc., they may frequently be in 
close-to-stall conditions where reverse substeps are likely 
to be most frequent 0, |46| . 

In conclusion, we have provided a detailed analysis of 
the statistics of mechanochemical transitions that must 
be hidden in the experimental noise when a motor pro- 
tein on its track moves processively via distinct steps, or 
reaches stall. As experimental resolution at the microsec- 
ond and nanometer scales improves, we can expect that 
such analyses will be increasingly valuable for extracting 
reliable inferences about motor mechanisms from obser- 
vational data. 
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APPENDIX A: EXPRESSIONS FOR 
TWO-STATE AND FOUR-STATE MODELS 

For convenience of reference we provide here explicit 
expressions for N = 2 models with M = 1. First, we 
recall the full-cycle expressions [34| 



7r+ = 1 — 7T_ = 



U Ul 



U Ui + W Wi 



(Al) 



UQ + Ui + Wq + Wl 

T+=T-=T= ■ , (A2) 

UqU\ + WqWi 



the result for general N being given in (|105|) . (1106p . Al- 
lowing for hidden substeps leads to 



7T + = 1 — 7T_ = 



Ui(u + Wq) 



UqU\ + WqW\ + 2li 1 U>o ' 



(A3) 



while the distinct prior dwell (or stepping) times are 
given by 



(u + Wq) 2 + Mq(ui + Wl) 
(U + W )(lL Ui + W Wi + Ml W ) 

(Ml + Wi) 2 + W 1 (u + Wq) 

{ui + wi)(u ui + wqi«i + UiWq) 

with the mean overall dwell time 

u + Ui + Wo + Wi 
UqUi + WqWi + 2uiWq 



'+ = 'o+ 



T = 7T + T + + 7T_T_ = 



■ (A4) 
, (A5) 

(A6) 



For general N see (j40|), (JSJ), and ([60j) -([62 ]> . 

The partial or conditional pairwise step probabilities 
follow, by specializing (|3"T|) and (|3"5)) . as 



7T i i = 1 - 7T i _ 



U Ul 



UqU\ + WqWi + UlWQ ' 



n l = 1 — 7r = 



ui(u + w ) 



(A7) 



(A8) 



u ui + wqwi + mwo 
The denominators here, say 

ID 2 = u ui + wqwi + uiwq, (A9) 



should be contrasted with those in (|A3|1 and (|A6|) . 
namely, 

ID J = 1D2 + uiwo — uqU\ + WqWi + 2uiwo- (A10) 

At the next level of individual (or partial) (sub)state 
properties, the individual or substate dwell times follow 
from (|59p . which yields 



T = (ui + Wi)/ID+ T 1 = (u +w )/^t, (AH) 
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which, in accord with (J62J), satisfy r = Tq + T\. The 
partial substate dwell times, given generally in (|48I49[) . 

are 

T+ = (u x + u>i)/2> 2 , T x + = wo/S 2 , (A12) 

To = wx/®2, T- = (u + w )/S 2 . (A13) 

Finally, the conditional or pairwise mean dwell or step- 
ping times are 

t++ = r__ = (u +u 1 +w +wi)/£>2, (A14) 
T+- = [("l + wji) 2 +u w 1 ]/{u 1 + wi)D 2 , (A15) 
t-+ = [("o + wq) 2 + u Q wi}/(u + w Q )'£i 2 . (A16) 

The mean numbers of hidden transitions follow from 
the results in Sec- IXTlEl For the simplest N = 2 (M = 1) 
case we obtain 

„++ 



notation 



- 1 


= n o,+ = 

= n t>7+ = 


— h — O 

n o,+ — n o,+ — n o,+ 
ri^~^_ = n^Z = ^T- 






= n l- - 


1 = nfZ. = nft_ = "o"i/£> 2 , 


(A17) 


,+<> 

'0,+ 


= <V = 


u ("i + w 1 )/D 2 , 


(A18) 


c 


=<- = 


Wx(u + w )/D 2 , 


(A19) 


,<x> 

'0,+ 


= "o("i - 




(A20) 




= wi(«o - 




(A21) 



To provide further insight we quote some results for 
N = 4 models with M = 3, i.e., with the major step as 
the last transition from state (3) to [4 = 0]. Thus we 
have 

tt_ = w Q m Q /T>i, 7T+ = ii 3 9t 3 /S+, (A22) 

r = (m + m 1 + m 2 + m 3 )/®+, (A23) 

where, to write the numerator and denominator contri- 
butions compactly, we introduce the short-hand product 



"0,1 = "0"l, 1/0,1,2 = "o"i"2, ■ • • , 

w ,i = w wi, w , 1,2 = w wiw 2 , • • ■ , (A24) 
etc. Then we have 

% = "1,2,3 + "l"2,3 + "l,2"3 + "l,2,3, (A25) 

% = "2,3,0 + "?2"3,0 + "2,3"0 + "2, 3,0, (A26) 

% = "3,0,1 + "3"0,1 + W3.QUX + 103,0,1, (A27) 

% = "0,1,2 + "o"i,2 + w .iu 2 + w ,i,2, (A28) 

while the denominator is given by 

D| = U'o% + "3^3 = "0,1,2,3 + 2w "l,2,3 

+ 2w Q . iu 2 . 3 + 2iu ,i, 2"3 + "0,1,2,3- (A29) 
For the purpose of comparison we quote the result for 
the velocity, namely, 

V = d(«o,i,a,3 - "o,i,2,3)/(% + % + 9t 2 + m 3 ), (A30) 



which, of course, is not simply d/r: see f|107[) . Finally, 
then we also quote 

t++ = t__ = [m + m 1 + m 2 + y\ 3 

+ "o"3("i + wi + u 2 + w 2 )]/£> 4 , (A31) 



and 



"2,1.0% + "2,lDl9tl + "25)2% + ®3% , * on , 
T+ = — ■ — — — , (Aoz) 



©4% 



where 



®JV = "0,1,-,JV-1 + "o"l,2,---,AT-l 

+ "o,i"2,3,--- ,n-i H 1- "o,i, ■■■ ,jv-i, (A33) 

from which r_ follows by using 7t_t_ + 7T+T+ = r. Of 
course, results for A/ = 0, 1, and 2 and for 7r ++ , r ++ , Tj, 
etc., follow from the expressions derived in Sec. IIIIl 
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